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ABSTRACT 

Connecting galaxies with their descendants (or progenitors) at different redshifts can 
yield strong constraints on galaxy evolution. Observational studies have historically 
selected samples of galaxies using a physical quantity, such as stellar mass, either 
above a constant limit or at a constant cumulative number density. Investigation 
into the efficacy of these selection methods has not been fully explored. Using a set 
of four semi-analytical models based on the output of the Millennium Simulation, 
we find that selecting galaxies at a constant number density (in the range —4.3 < 
log n [Mpc~ 3 h 3 } < —3.0) is superior to a constant stellar mass selected sample, 
although it still has significant limitations. Recovery of the average stellar mass, stellar 
mass density and average star-formation rate is highly dependent on the choice of 
number density but can all be recovered to within < 50% at the commonly employed 
choice of log n [Mpc -3 h 3 } = —4.0, corresponding to log M Q /h ~ 11.2 at z = 0, but 
this increases at lower mass limits. We show that there is a large scatter between the 
location of a given galaxy in a rank ordering based on stellar mass between different 
redshifts. We find that the inferred velocity dispersion may be a better tracer of galaxy 
properties, although further investigation is warranted into simulating this property. 
Finally, we find that over large redshift ranges selection at a constant number density 
is more effective in tracing the progenitors of modern galaxies than vice-versa. 
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1 INTRODUCTION 

In the now commonly accepted paradigm, galaxies form in 
the gravitational wells of collapsed cold dark matter halos, 
which themselves are seeded by primordial quantum fluctu¬ 
ations in the Universe’s first moments. In this hierarchical 
picture of galaxy formation and evolution, galaxies build up 
stellar mass through both in-situ star-formation and galaxy 
mergers, where more massive galaxies merge with smaller 
systems which result in more massive descendants. Over 
time, these processes produce the array of galaxies and the 
environments within which they are observed. In order to 
understand how galaxies form and evolve, the evolution in 
their properties (e.g. stellar mass, size) must be observed. As 
the most massive, and thus the brightest, galaxies are the 
easiest systems to observe out to high redshift, it is these 
galaxies and their properties that observations attempt to 
study. This has typically been achieved by selecting galaxies 
in two ways. 

* E-mail: carl.mundy@nottingham.ac.uk 


Historically, selecting galaxies across a redshift range 
of interest using a constant stellar mass cut has been used 
to study the evolution of the most massi ve galaxies (e.g. 
IConselice et al.ll2003l : iMortlock et. al.lEoU 'l. Use of this se¬ 
lection method intrinsically assumes galaxies have more or 
less been a passively evolving population from high redshift. 
However, processes such as major galaxy mergers and bursts 
of star-formation interfere with these assumptions and con¬ 
taminate the selection by changing the rank order of galax¬ 
ies. Thus, the wider the redshift range this method is applied 
to, the less accurately it may trace the galaxies of the orig¬ 
inal selection. 

Selecting galaxies at a constant cumulative comoving 
number density, when ranked by some physical property 
such as stellar mass or luminosity, has proven a popular al¬ 
ternative in the recent literature whe n observin g both field 
and cluster galaxy evolution fe.g. lLin et al ,2m3). Using this 
technique, the averaged star-formation history of a constant 
number density selected sample of galaxies over the redshift 
range 3 < z < 8 has been shown to be able to account for th e 
stellar mass growth of these galaxies dPapovich et alJl2010h . 
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The average stellar mass of the most massive galaxies (those 
with logM* > 11.0 at z = 0, and logn [Mpc~ 3 ] = —4.0) has 
been found to increase by a factor of ~ 4 over the redshift 
range 0.3 < 2 < 3. However, the integrated star-formation 
history appears unable to account for the growth in stellar 
mass at 2 < 1.5. Therefore, the influence of both major and 
minor galaxy merger s is required to acco unt for this discrep¬ 
ancy at low redshift (lOwnsworth et aPl2014|j . 

Furthermore, studies have observed the evolution in 
Ha equivalent width, structural properties and colours of 
galaxy populations selected at various num ber densities 
IIFumagalli et al.ll2012l : iMarchesini et alJl2014lf . Stellar mass 
measured inside a radius of r — 5kpc on stacked images 
of massive galaxies (selected at logn [Mpc -3 ] = —3.7 cor¬ 
responding to galaxies with log M* [Mq] > 11.4 and 
log M* [Mq] > 11.1 at 2 = 0.1 and 2 = 2.0 respec¬ 
tively) is found to be approximately constant over the red¬ 
shift range 0.6 < z < 2.0. On the other hand, the stellar 
mass content be yond this radius is foun d to increase by 
a fact or of ~ 4 ( van Dokkum et all l20ld ) . IConselice et al.l 
(2013) compare the derived gas fractions of massive galaxies 
(M* > 10 11 Mq) with their star-formation histories in the 
redshift range 1.5 < 2 < 3, selecting galaxies at a merger- 
adjusted constant number density. They conclude that gas 
accretion is the dominant source of observed stellar mass 
production for these galaxies over this redshift range. 

Investigation into the efficacy of either selection method 
has not been fully explored. Num erical calculations pre¬ 
sented in Ivan Dokkum et ah Sioic]) suggest the influence of 
galaxy mergers has little influence on stellar mass growth 
when measured using a cons tant number density selected 
sample. [Papovich et al.l ll2010l l used dark mat ter halo merger 
trees from the Millennium Simulat iorB (MS; ISpringel et al.l 
120051 : iLemson fe Consortium! l2006l ! to show that the recov¬ 
ery fractio n of descendan t halos at redshifts 3 < 2 < 8 
is ~ 50%. iBehroozi et al.l (I 2 OI 3 I) found a small change in 
the cumulative number density of the most massive (A/» > 
10 11,5 Mq) 2 = 0 progenitor galaxies of +0.22 dex per unit 
A 2 . Furthermore, they find that this change and thus the 
mass histories of descend a nts an d progeni tors are different . 
More directly, iLeia et all (l2013l l used the iGuo et all J2011L 
Gil) semi-analytical model (SAM) applied to the MS in 
order to test the validity of the underlying assumptions of 
constant number density selection. They find that, within 
this particular SAM, the median stellar mass of descendant 
galaxies can be recovered over the redshift range 0 < 2 < 3 
to within ~ 40% of the true value. Corrections for stellar 
mass growth rate scatter, galaxy mergers and quenching are 
found to reduce this discrepancy to within ~ 12% - well 
within typical observational error attributed to the calcula¬ 
tion of stellar masses. These results, however, are model and 
cosmology dependent and are sensitive to the dark matter 
merger trees and the recipes used to determine galaxy prop¬ 
erties. How the stellar mass is calculated is a prime example. 
Different methods of calculating this may introduce different 
levels of scatter into the rank order of galaxies across red- 
shifts. Sensibly investigating the efficacy of these techniques 
requires a mixture of SAMs, merger trees and cosmology to 
gauge the amount of variance in the results. 


1 http:// tow. mpa-garching.mpg.de/galf orm/virgo/millennium/ 


While stellar mass is used in the studies mentioned 
previously, it may not be the most appropriate property 
with which to rank and select galaxies in order to trace 
their properties. Increasing evidence has suggested that the 
central velocity dispersion of a galaxy is a good predictor 
of galaxy properties, including star-formation rate (SFR) 
and colour across large redshift ranges. Furthermore, it is 
thought to be a more stable quanti ty with redshift com¬ 
pared to, for exa mple, stellar mass ( Bezanson et al.ll2012l : 
IWake et al.ll20121 ) , partly due to the weak dependence of ve¬ 
locity dispersion on both stellar mass and galaxy size, with 
a oc (M*/ 7 ? e ) 2 . ILeia et all (I 2 OI 3 I ) briefly investigated the 
change in velocity dispersion for descendants of 2 = 3 galax¬ 
ies selected at a constant cumulative number density. They 
found a small change (< 0.15 dex in logn) in the average in¬ 
ferred velocity dispersion from 0 < 2 < 3 in the Gil SAM. 
Similarly, si mulations o f mass ive galaxies (log M* > 10.8) 
presented in lOser et ahl d2012|j find an increase in velocity 
dispersion of A a = 0.2 dex over the redshift range from 
2 = 2 to 2 = 0, consistent w ith o bservatio nal estimates 
(A g.jjavier Cenarro fe Trunlloll2009l ; iMartinez-Manso et al.l 

[2011). These observations warrant a detailed study into the 
use of inferred velocity dispersion in place of stellar mass. 

To generate an accurate framework for how galaxies 
form and evolve, one must observe the evolution of their 
properties over time. With these observations, models can 
be crafted to explain them. If, however, the evolution is 
not traced correctly, these frameworks can deviate from the 
truth. Although the aforementioned literature works pro¬ 
vide some arguments to support the use of their selection 
methods, no study has attempted to quantify the recovery 
efficiency of these methods. To this end, we study in de¬ 
tail the ability of these selection methods to trace individ¬ 
ual galaxies, as well as their stellar mass and star-formation 
properties over cosmic time. 

In this paper we use models of galaxy evolution to in¬ 
vestigate the ability of galaxy selections which are a) above 
a constant stellar mass limit; b) at a constant cumulative co¬ 
moving number density in stellar mass; c) above a constant 
stellar velocity dispersion limit; and d) at a constant cumula¬ 
tive comoving number density in stellar velocity dispersion. 
We test how well these methods trace the true evolution of 
progenitor and descendant populations initially selected at 
redshifts 2 = 0 and 2 = 3 respectively. 

This paper is constructed in the following way. In §2 we 
discuss the data and the metrics we use to test the efficacy 
of the observational selection methods. In §3 we present the 
results and analysis of our work when selecting galaxies us¬ 
ing their stellar mass. In §4 we present results when selecting 
galaxies using their inferred stellar velocity dispersions. Fi¬ 
nally, we discuss and conclude our main results in §5 and 
§6. We describe the simulations and associated cosmologies 
we use in section sun 


2 DATA & SELECTION METHODS 

In this paper, we determine the efficacy of two different 
methods in recovering the direct (i.e. most massive) pro¬ 
genitors and descendants of the most massive galaxies from 
initial selections at redshifts of 2 = 0 and 2 = 3, respectively. 
The first selection method is at a constant limit (in either 
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stellar mass or velocity dispersion), above which galaxies are 
selected. The second selection is at a constant cumulative 
comoving number density. This is achieved by integrating 
the galaxy stellar mass function (GSMF), or the galaxy ve¬ 
locity dispersion function (GVDF) if velocity dispersion is 
used, to obtain the integrated number density as a function 
of stellar mass (or velocity dispersion). From this we obtain 
the stellar mass limit above which all the galaxies are below 
a certain number density. The sample we examine at that 
redshift thus contains all galaxies with a stellar mass greater 
than this value. Additionally, we want to quantify how well 
each selection method recovers both the average and sum to¬ 
tal stellar mass in the descendant or progenitor populations, 
as well as the average SFR. The combination of cosmologi¬ 
cal dark matter simulations and semi-analytical recipes, as 
well as cosmological hydrodynamical simulations, continue 
to provide the only environments in which to conduct such 
an investigation. 


2.1 Simulated Data 

To test this, we utilise the output of the Millennium Simu¬ 
lation (MS) and the catalogues of four SAMs applied to it 
and its variants. The original simulation consists of 2160 3 
dark matter particles of mass 8.6 x 10 s h -1 M@ within a 
comoving box of size 500 h -1 Mpc on a side. The MS uses 
a ACDM cosmological model with = 0.25, fib = 0.045, 
h = 0.73, 12 a = 0.75, n s = 1 and as = 0.9, where the Hubble 
constant is param etrized as Ho = 100 h km s -1 Mpc^ 1 . The 
I Guo et all (I2013i . G13) SAM utilises a subsequent simulation 
which follows 2160 3 particles of mass 9.3 x 10 s h _1 Mq us¬ 
ing an updated WMAP7 llKomatsu et al.ll201ll l cosmology 
with fl m = 0.272, fib = 0.0455, h = 0.704, fl A = 0.728, 
n s = 0.967 and as = 0.81. The main difference between 
these two simulations is the value of the linear power spec¬ 
trum amplitude on scales of 8 h _1 Mpc, ag, which affects the 
dark matter halo merger r ate, and thus the gala xy merger 
rate , inside the s i mulat ion dConselice et alj|2014h . 

iBower et al.l (120061 . B06) presents an updated variant 
of the Durham s emi-analytical model of galaxy formation 
(ICole et all 120001 1 in which the treatment of active galac¬ 
tic nuclei (AGN) and stellar feedback on halo quenching is 
improved. They find that these updated treatments reduce 
the number densities of higher mass galaxies and remove 
cooling flows from rich clusters. iDe Lucia &; Blaizod l|2006l. 
D06) applied their model to the output of the MS with up¬ 
dated treatments for stellar populations, dust attenuation 
and cooling flow suppression via AGN feedback. They find 
that supernovae and AGN feedback processes play a vital 
role in the early quenching of star-formation in the progeni¬ 
tors of local brightest cluster galaxies (BCGs). Gil describe 
an updated model of galaxy formation and evolution with 
new recipes for supernovae feedback and galaxy bulge sizes 
among others. They find that the simulated abundance of 
massive galaxies, with log M* [Mq] > 11.0, are consistent 
with observations out to z ~ 1.0. However, they over predict 
galaxies of lower stellar mass beyond z ~ 0.6 and under pre¬ 
dict massive galaxies at 0 > 1.0 by at least an order of mag¬ 
nitude (see Fig. 23 in Gil). Finally, G13 describe the results 
of implementing their SAM in a WMAP7 cosmology. They 
find a requirement for weaker feedback and star-formation 
efficiency than a WMAP1 cosmology in order to reproduce 


the observed local GS MF. Mer ger trees used by B06 are 
described in lHarker et al.l d2006fl wh ile those employed b y 
the remaining SAMs are presented in TSpringel et al.l (1200514 . 
It is these models based upon these merger trees extracted 
from the Millennium Simulation from which we study the 
observational selection methods. 

Physically motivated semi-anal ytical models_of 

galaxy formation ( see, f or example, I Bower et al.l 120061 : 
Vogclsbcr ger et al.l I2014T ) applied to cosmological dark 
matter simulations, such as the MS, provide an unparalleled 
tool to probe the evolution of dark matter halos and the 
galaxies that reside within them. Simulations and observa¬ 
tions at low redshift (z < 2) are found to be consistent in 
many respects (e.g. luminosity functions), however SAMs 
have varying degrees of success in matching observational 
quantities beyond this. Comparison of different SAMs and 
other models show simulated galaxy stellar mass functions 
are generally consistent with most observations out to 
z ~ 2 if feedb ack mechanisms from AGN and sup ernovae 
are included dCroton et al.l 120061 : iLu et all l2013l l. This 
agreement extends to comparison between simulated and 
observe d major mergers for the mos t massive galaxies inside 
the MS dBertone fe Conselicell2009r ). but not for lower mass 
systems. 


2.2 Selection Method Metrics 

It is prudent to measure how accurately each selection 
method samples the progenitors or descendants of the galaxy 
population being studied. In this work, descendants of an 
initial z = 3 selection are identified by following the ‘de- 
scendantld’ property in the SAM output catalogues. At each 
step, duplicate descendants (due to mergers between two or 
more galaxies) are removed such that the number of true 
descendants decreases with time. On the other hand, pro¬ 
genitors of an initial z = 0 selection are defined as the most 
massive galaxy in the previous redshift snapshot that came 
to be the galaxy in the current snapshot. These definitions 
allow traversing of different branches along merger trees, de¬ 
pending on the direction we take. In this work we measure 
the ability of each selection method to recover galaxy prop¬ 
erties using various metrics. 

Firstly, the recovery fraction quantifies how many of the 
available progenitors or descendants are recovered at differ¬ 
ent redshifts in the sample obtained using a given selection 
method such that 

free = Ns / Ntot, (1) 

where N s is the number of descendants/progenitors included 
in the observational selection, and N to t is the total number 
of descendants/progenitors available to be selected. 

Although helpful, it may not strictly be necessary to 
sample the descendants or progenitors of interest to re¬ 
produce the true evolution of galaxy properties - sampling 
different galaxies from the true progenitors or descendants 
might be sufficient if the galaxies replacing those lost have 
similar properties. Therefore a low recovery fraction may not 
necessarily correspond to an inability to recover the true evo¬ 
lution in, for example, average stellar mass or SFR. Because 
of this, we also consider the fraction of the observed sample 
that is not a galaxy of interest, and call this the contamina¬ 
tion fraction, defined as 
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/contain = (N se \ — N„) / N e e 1 (2) 

where N se i is the number of galaxies within the observed 
selection and N s is the number of true (i.e. most massive) 
descendants/progenitors included in the selection. In this 
paper we also compare the true mean stellar mass, m* Iue , of 
the progenitors or descendants to that observed using each 
selection method, m* bs , defined as 

«m* = (m* bs - m* Iue )/rhtr ue = A m*/fr^ Iue . (3) 

In a similar fashion, the ability to trace the evolution of the 
stellar mass density, or sum of the stellar mass, is important 
to understand the build up of stellar mass in galaxies over 
time. We compare the observational selection techniques’ 
abilities to return the true mean stellar mass density, quan¬ 
tified as 

k p * = A(E m*)/E tritrae (4) 

where (E to*) is the sum of stellar masses. Finally, we extend 
this to the average SFR of the galaxies in a similar fashion 
such that the discrepancy between the true and observed is 
defined as 

km>* = A$* / $ t * rue (5) 

where H/* is the mean SFR. We choose to study these galaxy 
properties in particular because they are the most funda¬ 
mental, and the most used in the literature thus far. Ad¬ 
ditionally, we investigate whether each selection method is 
best applied to tracing progenitor or descendant galaxy pop¬ 
ulations, i.e. whether the selection methods are best applied 
forwards or backwards in time. 


2.3 Velocity Dispersion Selection 


As central velocity dispersion has been shown to exhibit a 
shallow evolution over time, it is prudent to investigate this 
physical property as a tracer. From scalar virial theory the 
stellar velocity dispersion of a system can be estimated by 


2 


a 


GM * 
5Ri 


(6) 


where G is Newton’s gravitational constant, A/* is the to¬ 
tal _stellar mass and R is the half-mass radius (see, e.g., 
ICappellari et al .1 120061 ) . Using the reported t o tal an d bulge 
stellar masses and sizes, we follow iLeia et all ( 2013 ) in esti¬ 
mating the half-mass radius of each simulated galaxy as 

r , MbRb + MdRd 

R h ~ Mb + Md ■ (7) 

As before, M is stellar mass, R is the half-mass radius and 
subscripts b and d correspond to the bulge and disk com¬ 
ponents respectively. We correct the disk scale radius pro¬ 
vided in the SAM catalogues to convert it to a half-mass 
radius such that Rd = 1.678J? sca ie- This relation is ideally 
obtained num erically, how ever approximations are available 
for a rang e of. Scrsic ( 196 ^) i n dices . We refer the interested 
reader to I Graham fe Driver] 120051 1 and references therein 
for more information. 

We calculate these metrics using both stellar mass and 
velocity dispersion at four constant number density selec¬ 
tions and four constant limits. Number density values are 


chosen to cover a wide range, allowing comparison with pre¬ 
vious work, and to be representative of what number densi¬ 
ties are currently applicable to observational studies. Con¬ 
stant limits in both stellar mass and velocity dispersion are 
chosen to enable comparison with the number density selec¬ 
tions. Constant stellar mass limits are defined as the mass 
limit of a number density selection at either a = 0 or z = 3, 
depending on whether progenitors or descendants are being 
investigated. In short, the limits are chosen such that the 
initial selections, whether at a = 0 or z = 3, are the same. 
In ij3]we report our findings using stellar mass as the rank¬ 
ing property and in fJ4]we report the results of using inferred 
velocity dispersion. 


3 STELLAR MASS SELECTIONS 

We investigate the ability of two different galaxy selection 
methods, using two different galaxy properties, to recover 
the mean stellar mass, stellar mass density, mean velocity 
dispersion and average SFR of progenitor and descendant 
populations. All our results are available to download on- 

lirnfl 


3.1 Descendants 

In Figure [T] we show the results of selecting descendants at 
four constant stellar mass limits, described in Table [T] This 
is defined as the stellar mass limit for a number density se¬ 
lection beginning at z = 3 and examining evolution at lower 
redshift. As one might expect, all constant stellar mass se¬ 
lections recovered all available descendant galaxies at every 
redshift (top panel), as galaxies typically experience a net 
gain in stellar mass over these redshift ranges. However, the 
fraction of the selected sample that are not descendants of 
interest, /contain, increases to ~ 50% by z = 2. By z = 0, 
samples selected above each stellar mass limit are almost 
completely contamination. This shows that using a constant 
mass cut at 2 = 3 selects essentially none of the same galax¬ 
ies (descendants) at lower redshift. 

Contrary to what this metric might suggest, the differ¬ 
ence between the observed and true mean stellar mass is 
underestimated by only 50% by 2 = 0, decreasing linearly 
at lower redshift. This seems to be largely independent of the 
stellar mass limit in all but the B06 SAM which fares rela¬ 
tively better at higher limits. Recovery of the median stellar 
mass is indistinguishable from the mean stellar mass for the 
two smallest number densities. At the two largest choices 
however, the median stellar mass is further underestimated 
towards lower redshift such that at 2 = 0, this property is 
underestimated by ~ 60%. The stellar mass density is over¬ 
estimated by a factor of ~ 4 (~ 20) at the lowest (highest) 
mass limit in all SAMs by 2 = 0. As the B06 SAM does not 
report SFRs in its catalogues, we instead consider only the 
remaining three SAMs in recovery of the mean SFR. At the 
lowest stellar mass limit, this is recovered to within ~ 10% 
down to 2 = 1. At lower redshifts, however, the SFR be¬ 
gins to be increasingly underestimated and by z = 0 it is 
underestimated by ~ 50%. 


2 http://goo.gl/X6QZGC 
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Figure 1. Recovery fraction of individual descendants, their av¬ 
erage stellar mass, stellar mass density and average SFR (rows) 
for four constant stellar mass limit selections (columns) covering 
the stellar mass range at 2 : = 3 of log M* > 10.7,10.6,10.4,10.2. 
Initial stellar mass selection limits for each SAM are given in 
the thir d col umn of Table [T] SAMs used are IBower et al.l (|2006l . 
B06 h iD e Lucia &; Blaizotl (I 2 OO 6 I , D06h [Guo et alj <12011 . Gil) 
and I Guo et al.l d 20131 , G13), represented by solid green, dashed or¬ 
ange, dashed-dotted magenta and dotted blue lines respectively. 
(A colour version of this figure is available in the online journal.) 


Figure 2. Recovery fraction of individual descendants, their av¬ 
erage stellar mass, stellar mass density and average SFR (rows) 
for four constant number density selections (columns) covering 
the number density range —4.3 < log n < —3.0. These ap¬ 
proximately correspond to stellar masses at 2 : = 3 of log M* > 
10.7,10.6,10.4,10.2. Initial stellar mass selection limits for each 
SA M are given i n the third column of Table [ll SAM s used 
are IBower et ah d2006l B0 6h ID e Lucia Sz Blaizotl d 20061 , D06), 
IGuo et alj d2011 . Gil) and I Guo et al.l d 20131 , G13), represented 
by solid green, dashed orange, dashed-dotted magenta and dotted 
blue lines respectively. (A colour version of this figure is available 
in the online journal.) 


We show in Figure 0 that by using a constant num¬ 
ber density selection the recovery fraction, / rec , decreases 
exponentially with decreasing redshift such that by 2 = 0 
between 30% and 60% of the available descendants are se¬ 
lected. The contamination fraction is found to vary between 
half and three quarters of the sample at the highest and 
lowest number densities respectively. Mean stellar mass is 
overestimated by a factor that increases with both number 
density and redshift, overestimating the true value by a fac¬ 
tor of 1.3 and 1.6 by z = 0. While not plotted, the median 
stellar mass is qualitatively similar above 2 ~ 0.5. Below 
this, the median stellar mass is further overestimated by up 
to 30% more at z = 0. Similarly, the stellar mass density 
is eventually overestimated by a factor of 1.5 (1.8) at the 
largest (smallest) number densities. Finally, over the entire 
redshift range the SFR is recovered to within ~ 50% in all 
SAMs and at all number density choices. 

Compared to a constant stellar mass selection, a con¬ 
stant number density selection recovers far fewer of the true 
descendants at lower redshift. However, at all number den¬ 
sity choices, the lower redshift selections have considerably 
less contamination. 


3.2 Progenitors 

Now we take an initial selection at z = 0 and trace the 
most massive progenitors back to higher redshifts. As Fig¬ 
ure [3] shows, the recovery of individual progenitors using a 
constant stellar mass limit (detailed in Table [l]) deteriorates 
exponentially such that at z = 1, only 30% are recovered 
in the selection at the smallest stellar mass limit and less 
than 5% at the largest stellar mass limit (smallest number 
density). The sample’s contamination fraction increases im¬ 
mediately and, at all stellar mass limits, the sample is > 95% 
contamination by z = 3. 

The discrepancy between the true and observed mean 
stellar mass of the progenitors increases approximately lin¬ 
early with redshift, overestimating the mean mass by a fac¬ 
tor of three at 2 = 3, independent of stellar mass limit and 
weakly dependent on the choice of SAM. Recovery of the me¬ 
dian stellar mass is again indistinguishable from the mean 
recovery at the two largest stellar mass limits. At the two 
highest, the median mass is overestimated by factors of 3 — 7 
times. Furthermore, observed stellar mass density is increas¬ 
ingly underestimated with redshift in all SAMs. Finally, the 
mean SFR is recovered to within a factor of ~ 4 by z = 3. 
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Table 1. Constant stellar mass limits for progenitors and de¬ 
scendants, defined as the stellar mass limit for a number density 
selection at z = 0 and 2 = 3 respectively. 


SAM 

n [Mpc -3 /i 3 ] 

Descendants (z = 3) 
M lim [log M Q /h\ 

Progenitors (z = 0) 
Mum [log M @ /h] 

B06 

1 x 10“ 3 

10.32 

10.87 


5 x 10 -4 

10.51 

11.01 


1 x 10 -4 

10.76 

11.22 


5 x 10 -5 

10.82 

11.30 

D06 

1 x 10 -3 

10.31 

10.92 


5 x lO" 4 

10.46 

11.04 


1 x 10 -4 

10.68 

11.29 


5 x 10“ 5 

10.74 

11.39 

Gil 

1 x 10“ 3 

10.21 

10.84 


5 x 10 -4 

10.33 

10.97 


1 x 10“ 4 

10.52 

11.24 


5 x 10 -5 

10.59 

11.35 

G13 

1 x 10“ 3 

10.00 

10.77 


5 x 10“ 4 

10.15 

10.89 


1 x 10“ 4 

10.40 

11.12 


5 x 10 -5 

10.48 

11.21 


Selecting progenitors at a constant number density fares 
relatively better, as shown in Figure[4] Out to 2 = 3, no less 
than ~ 50% (~ 30%) of progenitors are recovered at the 
largest (smallest) selections. The mean stellar mass is re¬ 
covered to within a factor of ~ 1.5 at 2 = 3 in all SAMs and 
choices of n and the observed stellar mass density follows 
a very similar trend. Median stellar mass recovery is indis¬ 
tinguishable from the mean recovery except at the largest 
number density. Here up until 2 ~ 2 where the overestima¬ 
tion becomes larger by approximately ~ 50%. Lastly, the 
mean SFR is recovered to within ±20% at all number den¬ 
sity selections except for DOG, which overestimates the SFR 
at a peak of ~ 50% at 2 = 1. 

Comparing these results with the descendant popula¬ 
tion (S3, we find that a constant cumulative comoving 
number density selection recovers all descendent population 
properties within a factor of two of the true value. Simi¬ 
larly, all the progenitor properties are recovered to within 
a factor of 1.5 of the true value. Furthermore, a constant 
number density selection appears to trace the ensemble pro¬ 
genitor properties of 2 = 0 massive galaxies better than the 
descendants of those at 2 = 3. 


3.3 Fitting Forms 

For convenience, we fit parametric functions to the metrics 
described above for a constant number density selected sam¬ 
ple of galaxies. These fits are strictly valid over the redshift 
range 0 < 2 < 3. For each metric, we take the average 
value of all SAMs at each redshift. Next, at each redshift we 
randomly sample a Gaussian of width equal to the spread 
between SAMs, centred on the mean value. The function is 
then fit to these points. These steps are repeated 10 4 times 
to obtain the average parameters and their associated errors 
for each metric at each number density. Our detailed results 
of fitting for all number densities are reported in Table [All 



J-,-1-,-1-,-lXj-,-1-,-1-,-L_Lj- 1 -1- 1 -1- 1 -lXj- 1 -1-i-1- 1 -L. 

T-'-1- • -1-'-TTT- 1 -1---1- 1 -TXT-'-1-'-1-'-TXT- 1 -1-'-1-'-T 



—1 _ 1 _I_ 1 _I_ 1 _L±J _ 1 _ I _ 1 _I_ 1 _ L±J _1_ I _ 1 _ I _1_ L±J _ 1_ I _ 1 _I_ 1_ LJ 

0.0 1.0 2.0 3.0 0.0 1.0 2.0 3.0 0.0 1.0 2.0 3.0 0.0 1.0 2.0 3.0 

Z Z Z Z 


Figure 3. Recovery of individual progenitors, their average stel¬ 
lar mass, stellar mass density and average SFR (rows) for four 
constant stellar mass limit selections (columns) covering the num¬ 
ber density range —4.3 < log n < —3.0. The B06, D06, Gil and 
G13 models represented by solid green, dashed orange, dashed- 
dotted magenta and dotted blue lines respectively. Progenitor 
stellar mass limits for each SAM are given in the third column of 
Table [I] (A colour version of this figure is available in the online 
journal.) 


The descendant galaxy recovery and contamination 
fraction is fit with a function of form 

/ = a + b x exp (c x (1 + z )), (8) 

where a, b and c are free parameters. We find that forc¬ 
ing a to zero for the recovery fraction, and c to unity for 
the contamination fraction, gives better fits. The recovery 
of average stellar mass (tt m *) and stellar mass density (ftp*) 
are then parametrised as 

k, = a + b x (1 + z). (9) 

To fit the progenitor galaxy contamination fraction, we mod¬ 
ify the parametrisation to the form 

/' = a + b x (1 ± z) + (1 ± z) c . (10) 

Finally, we do not fit the recovery of average SFR recov¬ 
ery. Fitted parameters and associated errors are available in 
Table HU for constant cumulative number density selected 
samples. 
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Figure 4. Recovery of individual progenitors, their average stel¬ 
lar mass, stellar mass density and average SFR (rows) for four 
constant number density selections (columns) covering the num¬ 
ber density range —4.3 < log n < —3.0. The B06, D06, Gil and 
G13 models represented by solid green, dashed orange, dashed- 
dotted magenta and dotted blue lines respectively. (A colour ver¬ 
sion of this figure is available in the online journal.) 

4 INFERRED VELOCITY DISPERSION 

SELECTIONS 

In an era of ever larger and deeper spectroscopic surveys, 
the extra information these observations afford of internal 
properties could possibly be employed as a better tracer of 
progenitor or descendant galaxy properties. One product of 
such a survey is the measurement of the central stellar ve¬ 
locity dispersion of a galaxy. 

As the SAM catalogues do not report the velocity dis¬ 
persions of galaxies, we infer this quantity using Equations[6] 
and [7] The inferred velocity dispersion, as defined here, is a 
relatively direct observable at the redshifts probed because 
the stellar mass and half-mass radius are observable. In this 
section we report the results of using this property in place of 
stellar mass. As a reminder, this would be achieved observa- 
tionally by integrating the GVDF to obtain the cumulative 
number density of galaxies as a function of their velocity 
dispersion. The velocity dispersion limit, above which all 
galaxies are at a number density n, can simply be read off. 

4.1 Descendants 

As displayed in Figure[5j selecting galaxies above a constant 
inferred velocity dispersion limit, given in Table [7] results 
in slowly losing descendants with decreasing redshift in the 
B06 SAM. However, in the Gil and G13 SAMs, the recovery 


Figure 5. Recovery of individual progenitors, their average stel¬ 
lar mass, stellar mass density and average SFR (rows) at four 
constant inferred velocity dispersion selections (columns) cover¬ 
ing the range —4.3 < log n < —3.0. The B06, Gil and G13 mod¬ 
els are represented by solid green, dashed-dotted magenta and 
dotted blue lines respectively. Inferred velocity dispersion limits 
given in Table [2] (A colour version of this figure is available in 
the online journal.) 

fraction increases below z < 2. At z = 0, 90% and 60% of 
descendant galaxies are selected above the lowest and high¬ 
est inferred velocity dispersion limits respectively. As with 
selection above a constant stellar mass limit, the contamina¬ 
tion fraction increases exponentially towards lower redshift. 
At the lowest (highest) velocity dispersion limits there is sig¬ 
nificant contamination in the observed sample at the level of 
70% (90%). Recovery of the descendant mean stellar mass 
is increasingly underestimated. The true value is maximally 
underestimated at 2 = 0 at all inferred velocity dispersion 
limits by 30%. Conversely, the stellar mass density is in¬ 
creasingly overestimated with time by up to a factor of ~ 5 
times the true value. Finally, a constant inferred velocity 
dispersion selection recovers the descendants’ average SFR 
to within 10% at all limit choices and redshifts. 

In Figure[6]we show the result of selecting at a constant 
cumulative number density in inferred velocity dispersion. 
Inferred velocity dispersion is, as defined in this paper, a 
function of and proportional to stellar mass for each galaxy 
type (early and late) and so it is not surprising that the 
results are similar to those obtained in fJ3] Comparing with 
the stellar mass selections described in so these results 
suggest inferred velocity dispersion is just as competent a 
tracer as stellar mass, and even more accurate in some cases. 
However, any improvements are small over the use of stellar 
mass. 
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Figure 6. Recovery of individual progenitors, their average stel¬ 
lar mass, stellar mass density and average SFR (rows) for four 
constant number density selections (columns) covering the range 
—4.3 < log n < —3.0. The B06, Gil and G13 models are rep¬ 
resented by solid green, dashed-dotted magenta and dotted blue 
lines respectively. (A colour version of this figure is available in 
the online journal.) 


Figure 7. Recovery of individual progenitors, their average stel¬ 
lar mass, stellar mass density and average SFR (rows) at four 
constant inferred velocity dispersion selections (columns) cover¬ 
ing the range —4.3 < log n < —3.0. The B06, Gil and G13 mod¬ 
els are represented by solid green, dashed-dotted magenta and 
dotted blue lines respectively. Inferred velocity dispersion limits 
given in Table [2] (A colour version of this figure is available in 
the online journal.) 


4.2 Progenitors 

Figures [7] and [H] display the results of attempting to trace 
the progenitors of z = 0 galaxies via selection above a con¬ 
stant inferred velocity dispersion and at a constant cumula¬ 
tive number density, ordered by inferred velocity dispersion, 
respectively. Selection above a constant limit loses progen¬ 
itor galaxies from the sample with increasing redshift. By 
z = 3, only 10% of true progenitors are sampled at all ve¬ 
locity dispersion limits and SAMs. The average stellar mass 
is increasingly overestimated with redshift, by 50 — 100% 
at z = 3. The stellar mass density is increasingly underes¬ 
timated. In the B06 SAM, it is even more underestimated 
at higher inferred velocity dispersion limits than at smaller 
limits. However, in the Gil and G13 SAMs, it is underesti¬ 
mated by 70% at all limits by z = 3. Similarly, the SFR is 
recovered to within 50% in the two SAMs considered. 

Selection at a constant cumulative number density, or¬ 
dered by inferred velocity dispersion, results in recovery frac¬ 
tions similar to the number density selection using stellar 
mass. Decreasing exponentially from 2 = 0, 50% (30%) at 
the largest (smallest) number densities. Both the average 
stellar mass and the stellar mass density are recovered to 
within 40% of the true value across all redshifts, SAMs and 
number densities investigated in this paper. Finally, the SFR 
is recovered to within 20% at all times. 


Table 2. Inferred velocity dispersion limits for progenitors and 
descendants, defined as the minimum inferred velocity dispersion 
for a number density selection at z = 0 and 2 = 3 respectively. 
Inferred velocity dispersions are calculated using Equation [6] with 
the galaxy component stellar masses and sizes reported by each 
SAM. 


SAM 

n [Mpc 3 h 3 ] 

Descendants 
(Tlim [km s- 1 ] 

Progenitors 
o-lim [km s" 1 ] 

B06 

1 x 10“ 3 

133.1 

198.2 


5 x 10 -4 

181.7 

263.5 


1 x 10 -4 

309.3 

424.5 


5 x 10 -5 

365.4 

502.4 

Gil 

1 x 10 -3 

83.7 

131.8 


5 x 10 -4 

100.2 

156.9 


1 x 10 -4 

142.1 

208.9 


5 x 10“ 5 

161.6 

231.0 

G13 

1 x 10 -3 

69.3 

109.6 


5 x 10 -4 

84.7 

131.8 


1 x 10- 4 

124.7 

181.1 


5 x 10 -5 

143.8 

202.6 
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Furthermore, lower mass systems from below the selection 
at high redshift increase their stellar mass at a higher rate 
than those more massive systems and become most of the 
most massive galaxies in the local Universe. 

It is worth noting that all of the SAMs used in this work 
fail to match observed galaxy stellar mass functions beyond 
some redshift (typically z ~ 1.5) meaning that they also 

fail to reproduce the observed evolution of certain galaxy 
populations. A known problem with the original MS is the 
cosmology used. Use of a larger as than c urrently observed 
llKomatsu et al.ll201ll ; ICollaborationll2015ll will increase the 
merger rate and therefore the scatter in the rank order of 
galaxy stellar mass. Furthermore, this cosmology produces 
a larger population of quenched galaxies earlier than ob¬ 
served. This requires the SAMs to build up the low mass 
end of the GSMF at early times in o rder to ma t ch th e ob- 
served loca l stella r mass function. See lLeia et al.l (l2013l l and 
iGuo et ahl (1201 lh for an in-depth discussion into this and 
other issues. We would thus expect less scatter in the real 
Universe, and therefore better recovery of galaxy proper¬ 
ties, compared to the results ob tained here. C o smo logical 
hydrodynam i cal si mulations, e.g. iFurlong et al.l (l2014l l and 
iGenel et alJ (l2014l l. show closer agreement with observed 
galaxy stellar mass functions out to high redshift and may 
offer a better environment in which to conduct these tests. 


Figure 8. Recovery of individual progenitors, their average stel¬ 
lar mass, stellar mass density and average SFR (rows) for four 
constant number density selections (columns) covering the range 
—4.3 < log n < —3.0. The B06, Gil and G13 models are rep¬ 
resented by solid green, dashed-dotted magenta and dotted blue 
lines respectively. (A colour version of this figure is available in 
the online journal.) 


5 DISCUSSION 

Firstly, let us contrast the use of a constant stellar mass 
selected sample and a constant number density (in stellar 
mass) selected sample. As one may have expected, the for¬ 
mer recovers all descendants of an initial high redshift sam¬ 
ple. This is due to our definition of a descendant and that 
the stellar mass of systems can only increase with time inside 
these simulations. Even though the recovery fraction is high, 
the contamination fraction increases to > 80% within ~ 2 
Gyr as galaxies, initially unsampled, increase their stellar 
mass and move into the selection. 

Comparing the recovery and contamination fractions 
obtained through constant number density selections of de¬ 
scendants and progenitors, one can infer how these popu¬ 
lations have evolved. Taking the smallest number density 
choice of n = 5 x 10 -5 Mpc -3 h 3 , at z = 0 we recover 
30% of the available descendants and nearly three quarters 
of our selection is contamination. Similarly, at a = 3 we 
recover 30% of the progenitors and 70% of the sample is 
contamination. These results suggest that a large fraction 
of the progenitors of the most massive local galaxies are 
not the most massive at higher redshifts. Conversely, a large 
fraction of the most massive galaxies at high redshift are 
not among the most massive at lower redshifts. The one-to- 
one mapping in stellar mass rank order that this selection 
method assumes does not occur within these simulations. 


5.1 Where are the progenitors of 3 = 0 massive 
galaxies at high redshift? 

A simple question we can ask is where exactly in the ranking 
(in either stellar velocity dispersion or stellar mass) are the 
progenitors of the most massive galaxies at z = 0 at ear¬ 
lier times. Taking the two extremes of the number density 
choices in this work, we show in Figure O’)] the stellar mass dis¬ 
tributions of the most massive progenitors of 3 = 0 galaxies 
in the G13 SAM selected at two extreme number densities 
(top row). At each redshift we fit the stellar mass distri¬ 
butions with a Gaussian function and show in the middle 
panels that fit residuals are minimal (< 5%). This is done 
to quantify the changes in these distributions as a function 
of redshift. In the bottom row, we show how the properties 
of these distributions change with redshift. The mean and 
widths (distribution standard deviation) are shown in the 
bottom left and bottom right panels respectively. 

We find that at higher redshift, the mass distributions 
move systematically towards smaller mass galaxies and the 
stellar mass distribution widens. At the highest redshift, the 
distributions are found to have standard deviations of a = 
0.32 [log Mq h _1 ] and a = 0.21 [log JV/q h -1 ] for the largest 
and smallest number density selections. In both cases, the 
distributions increase in width by a factor of ~ 3 since 3 = 0. 

Furthermore, the selections made by a constant number 
density (the mass limits of which are indicated by the ver¬ 
tical dashed lines in the top panels of Figure 0 show that 
beyond 3 > 1, the majority of the progenitors are below this 
limit (i.e. the the peak of the actual progenitor stellar mass 
distribution is found at a lower stellar mass than the selec¬ 
tion’s stellar mass limit). Therefore, within the Millennium 
Simulation at the very least, it can be said the progenitors of 
the most massive local galaxies are not only the most mas¬ 
sive galaxies at higher redshifts - they span a wide range of 
masses at higher redshifts. For example, the most massive 
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progenitors span more than an order of magnitude (> 1 dex) 
in stellar mass for the largest initial number density selec¬ 
tion (top right panel) and ~ 0.5 dex for the smallest number 
density at z = 3. As shown in this work however, this does 
not appear to significantly impact the ability of a number 
density selected sample to recover the average stellar mass, 
star-formation rate or stellar mass density. 

5.2 How do mergers affect our selections? 

The number of progenitors or descendant galaxies can 
change over time due to mergers between objects within 
the initial sample. Selecting galaxies at a constant number 
density ignores these changes, and potentially contributes to 
the over- or underestimation of ensemble properties. 

To determine the extent of this, we calculate the number 
of mergers between the descendants of an initial selection at 
z = 3 in the B06 and G13 SAMs, as these are based on dif¬ 
ferent dark matter merger trees from the MS. In B06, 1.3% 
and 15.5% of galaxies in our initial selection are lost due to 
mergers from z = 3 to z = 0 at the smallest and largest 
number density selections. In the G13 SAM however, these 
measurements are higher at 10.9% and 29.6%, respectively. 
For the most massive galaxies, this translates to approxi¬ 
mately 3—5 mergers per massive galaxy (see Table[l]for mass 
limits) over the redshift range 0 < 2 < 3. It must be noted 
that these numbers represent all (total) mergers, and are not 
major mergers as they may include some mergers with mass 
ratios greater than 1 : 4, the most widely used definition. 
These measurements are slightly higher compared with pair 
fraction and morphological observations of major m ergers 
in comparably massive galaxies (see, e .g. jBluck et alj|200gl : 
IConselice et al]|2009t iMan et aPl2012li . 

At increasingly larger number densities, mergers within 
the descendant population may become increasingly impor¬ 
tant. As such, selection at a constant number density may 
not be applicable over such a redshift range. It may be ap¬ 
propriate to correct the number density between redshift 
bins to account for mergers that have occurred within the 
sample. However, the reduction of the number density in 
response to descendant galaxy mergers does not result in 
the desired effect. Qualitatively, reducing the number den¬ 
sity at each redshift results in higher stellar mass limits. The 
average stellar mass of the observed sample would therefore 
increase. As the average stellar mass and stellar mass density 
are already overestimated, this discrepancy would only in¬ 
crease. On the other hand, the ‘un-merging’ of galaxies going 
backwards in time would increase the number density used 
to trace progenitor galaxies. This would lower the stellar 
mass limit used to select the samples and thus decrease the 
measured average stellar mass of the observed samples. As 
this quantity is also overestimated, this discrepancy would 
be reduced. However, as we have increased the number den¬ 
sity contamination would also increase. 

5.3 Can we infer velocity dispersion in a 
semi-analytical model? 

We investigate inferred velocity dispersion in place of stellar 
mass as a ranking property due to evidence of a shallower 
and more stable evolution with redshift. It is prudent to 


ask whether this quantity can be accurately obtained from 
the SAMs used in this work. Use of Equation [6] implicitly as¬ 
sumes a spherically symmetric system and would correspond 
to a system with a Sersic index of n ss 5.5 llCappellari et aH 
l2006h . However, the factor in the denominator doesn’t ac¬ 
count for multiple components (i.e. a bulge and disk) and 
is influenced not only by Sersic index but also galaxy black 
hole mass. Thus the value calculated using this equation is 
a simplistic estimate at best and not strictly applicable to 
every type of galaxy. Furthermore, disk-dominated systems 
are not spherical and isotropic and thus this equation is not 
strictly applicable to these types of systems. 

Using the bulge-to-total stellar mass ratio (B/T) as a 
proxy for disk and bulge dominated morphologies, we find 
that the most massive galaxies at z = 0 (z = 3) in the B06 
SAM are typically bulge dominated with only 30% (40%) 
having B/T < 0.5. While this suggests Equation[6]is appli¬ 
cable at these redshifts, this SAM does not reproduce ob¬ 
servations^ larger diskMominatedfractions at high redshift 
llBluck et al.ll2014 : lBruce et al .1120141 ). The G13 SAM repro¬ 
duces observations more closely with 50% (95%) of systems 
having B/T < 0.5 at z = 0 (z = 3). Because of this, the 
velocity dispersions inferred within this SAM at the highest 
redshifts probed can be considered discrepant with observa¬ 
tions only at the highest redshifts. While these caveats must 
be taken into consideration, the values of velocity dispersion 
inferred are physical and generally in a greement with obser¬ 
vations of sphero idal/passive systems ( Bernardi et al.|[20ld : 
lOser et akll2012| h with 1.8 < logcr* [km s -1 ] < 2.7 depend¬ 
ing on the SAM. 

We must also consider whether the physical sizes of 
the simulated galaxy components can be used to infer the 
velocity dispersion. In Gil and G13, the resulting mass- 
size relations ar e shallower than the obse rvations (see, e.g., 
lLani et~ak I l2013l : Ivan der Wei et aT1l2014l ) with both masses 
and s izes larger (smaller) at low (high) redshift (iGuo et al.l 
fsoTTh - For the purposes of this work however, we only care 
if the evolution in M/R is represented correctly, and it is 
possible that, at least within Gil and G13, this may not be 
the case. We conclude that inferred velocity dispersion could 
provide a useful property with which to trace the evolv¬ 
ing properties of the most massive galaxies. However, more 
detailed future simulations that accurately reproduce the 
evolution in both stellar mass and galaxy component sizes, 
or that report a value for velocity dispersion directly, are 
needed to confirm these findings. 

5.4 Comparison with previous works 

Our results are consistent with the work of lLeia et al.1 (|2013l l 
who investigate cumulative number density selection of de¬ 
scendant galaxies over 0 < z < 3 in the range 0.5 < 
n [10 -4 Mpc -3 ] < 8.0 using the Gil SAM. Uncorrected 
for mergers and growth scatter, they show that for the two 
smallest number densities, the median stellar mass evolution 
is overestimated by between 0.05 — 0.15 dex (12 — 41%) by 
z = 0. Using the mean stellar mass, we find an overestimate 
at n = 5 x 10 ~ 5 Mpc~ 3 of 40% in the Gil and G13 SAMs. 
Most recently, iHenriaues et al.l (120141 . H06) contrasted the 
mean stellar mass of progenitors derived from a constant 
number density selection with the value s obtained from 
their SAM using a Planck dAde et al.ll2014l ~) cosmology. They 
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Figure 9. Progenitor mass distributions in the I Guo et al.l (* 201 .‘1 1 SAM for z = 0 galaxy selections at constant number densities 
log n = —4.3 Mpc -3 h 3 (top left) and log n = —3.0 Mpc -3 h 3 (top right). These number densities correspond to galaxies with a 
2 = 0 stellar mass of log M * > 11.21 [Mq h -1 1 and log M* > 10.77 [Mq h~ 1 ], respectively. Stellar mass distributions at 2 = 0,1,2,3 
are given in black, red, orange and green solid lines. Dashed vertical lines represent the stellar mass cuts inferred from a number density 
cut at each redshift. Residuals from Gaussian fits to these distributions are displayed in the middle panels. Parameters from Gaussian 
fits to the progenitor masses are shown in the bottom row, with the mean stellar mass and standard deviation at bottom left and bottom 
right respectively with the largest (blue squares) and smallest (red circles) number densities plotted as a function of redshift. (A colour 
version of this figure is available in the online journal.) 


found that mean stellar mass evolution is overestimated by 
a factor of 3 — 5 for the progenitors of galaxies with a 2 = 0 
stellar mass between 10.25 < log M* [Mq h -2 ] < 11.25. 
This is a larger increase than we find for similar mass galax¬ 
ies. We suggest this discrepancy is due to the ability of HOG 
to correctly reproduce the abundance of massive galaxies out 
to higher redshift. With less massive and passive galaxies, 
HOG must produce a larger evolution in stellar mass from 
high redshift to match the local stellar mass function. 


6 SUMMARY 

We have compared the use of two popular galaxy selection 
methods and contrasted the use of galaxy stellar mass and 
inferred velocity dispersion in semi-analytical models based 
on the output of the Millennium Simulation over the redshift 
range 0 < z < 3. We select galaxies above a constant limit 
of stellar mass in this redshift range, and conclude that: 

• Descendants can be fully recovered over the entire red¬ 
shift range regardless of mass limit choice. However, pro¬ 
genitors of z = 0 galaxies are lost from the selection with at 
least 80% below the stellar mass limit at 2 = 1. 

• Average descendant ensemble stellar mass is increas¬ 
ingly underestimated with increasing redshift by an amount 


that varies between SAMs but is, on average, around 50% 
at 2 = 0. Similarly, the average stellar mass of progenitors 
is increasingly overestimated. At 2 = 0, average progenitor 
stellar mass is overestimated by a factor of ~ 5. 

Additionally, selecting galaxies at a constant cumulative 
number density in stellar mass, we conclude that: 

• Recovery of individual descendant galaxies falls expo¬ 
nentially with a time scale dependent on choice of number 
density. Just 30% of the most massive galaxies (selected at 
log n = —4.3 Mpc -3 h 3 at 2 = 3) are at the same cumu¬ 
lative number density at the lowest redshift. For the largest 
number density selection, this increases to 60%. Recovery of 
progenitors is similar, but with 50 — 30% recovered at the 
highest redshifts, depending on choice of number density. 

• The average stellar mass of descendants is overesti¬ 
mated by 15% (70%) at the highest (smallest number densi¬ 
ties) by 2 = 0, increasing linearly from 2 = 3. Furthermore, 
independent of number density, progenitors’ average stellar 
mass is overestimated by ~ 50% at the highest redshift. 

Finally, we have investigated inferred velocity disper¬ 
sion as a property with which to trace galaxies over the 
same redshift range. We have found that a constant number 
density in velocity dispersion recovers average stellar mass, 
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stellar mass density and average SFR to within ±50% for 
both descendants and progenitors. Furthermore, selecting 
galaxies at a constant velocity dispersion limit recovers the 
aforementioned properties to within ±80% of the true val¬ 
ues. However, these results are based on inferring velocity 
dispersion of galaxies which may not be strictly applicable 
to some SAMs and redshift ranges. Further study is required 
into velocity dispersion through dedicated simulations that 
predict these properties. 

In conclusion, selecting galaxies at a constant cumula¬ 
tive number density is found to trace the true evolution of 
average stellar mass and the average SFR of the progenitors 
and descendants of galaxies in initial selections at z = 0 and 
z = 3. However, it does not trace the exact same galaxies but 
rather galaxies with very similar properties. Furthermore, it 
is found that selecting galaxies above a constant stellar mass 
with redshift returns the actual evolution within a larger fac¬ 
tor of between two and thirty. 
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APPENDIX A: FITTING PARAMETERS 

Here we present tabulated fitting parameters for descen¬ 
dant and progenitor galaxy populations, as described in 43.31 
These are obtained by taking the mean of individual SAM 
data points and taking the variance between SAMs as the 
lcr of a Gaussian distribution on each data point. Fitting 
is then processed on randomly sampled values 10 4 times to 
calculate the most likely fitting parameters and their asso¬ 
ciated uncertainties. 
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Table Al. Fitting parameters, described in 5331 for descendants and progenitors selected at a 
constant cumulative number density across the redshift range 0 < z < 3. 


Metric 


Descendants 



Progenitors 


a 

b 

c 

a 

b 

c 




n = lx 10 -3 

[Mpc -3 h 3 ] 



free 

/contair^^ 

ftp* 

0.000 ± 0.000 
0.511 ± 0.019 
0.191 ± 0.066 
0.592 ±0.050 

0.480 ± 0.030 
-0.010 ±0.001 
-0.044 ±0.019 
-0.156 ±0.015 

0.175 ± 0.018 
1.000 ± 0.000 

0.000 ± 0.000 
-0.708 ± 0.070 
-0.131 ± 0.030 
-0.132 ± 0.031 

1.212 ± 0.055 
-0.235 ± 0.056 
0.125 ± 0.017 
0.126 ±0.018 

-0.260 ±0.027 
0.563 ±0.064 




n = 5 x 10 -4 

[Mpc" 3 h 3 ] 



free 

,/contam 

Hm* 

Kp* 

0.000 ± 0.000 
0.568 ±0.032 
0.294 ±0.070 
0.596 ±0.061 

0.372 ± 0.044 
-0.011 ±0.001 
-0.071 ±0.020 
-0.156 ±0.018 

0.240 ± 0.032 
1.000 ± 0.000 

0.000 ± 0.000 
-0.670 ± 0.035 
-0.134 ±0.058 
-0.134 ±0.058 

1.234 ±0.057 
-0.242 ± 0.017 
0.134 ±0.036 
0.134 ±0.036 

-0.302 ±0.029 
0.580 ±0.009 




n = lx 10 -4 

[Mpc" 3 h 3 ] 



free 

,/contam 

Kp* 

0.000 ± 0.000 
0.725 ±0.050 
0.691 ±0.099 
0.742 ±0.062 

0.180 ±0.047 
-0.014 ±0.001 
-0.151 ±0.029 
-0.194 ±0.018 

0.430 ± 0.071 
1.000 ± 0.000 

0.000 ± 0.000 
-0.623 ± 0.036 
-0.107 ±0.072 
-0.107 ±0.073 

1.516 ±0.155 
-0.216 ± 0.019 
0.144 ±0.045 

0.145 ± 0.046 

-0.519 ±0.079 
0.582 ±0.005 




n = 5 x 10“ 5 

[Mpc -3 h 3 ] 



free 

,/contam 

t^m* 

ftp* 

0.000 ± 0.000 
0.787 ±0.054 
0.747 ±0.136 
0.868 ± 0.093 

0.128 ±0.044 

-0.014 ±0.001 
-0.198 ±0.041 
-0.228 ±0.028 

0.522 ± 0.096 
1.000 ± 0.000 

0.000 ± 0.000 
-0.621 ± 0.058 
-0.083 ± 0.062 
-0.085 ± 0.064 

1.723 ± 0.321 

-0.204 ± 0.026 
0.139 ± 0.036 
0.141 ± 0.037 

-0.626 ±0.144 

0.582 ±0.009 


a Descendants use Equation \&\ and progenitors use Equation HOl for fitting. 
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